Optimal distance separating halfspace ∗
نویسندگان
چکیده
One recently proposed criterion to separate two datasets in discriminant analysis, is to use a hyperplane which minimises the sum of distances to it from all the misclassified data points. Here all distances are supposed to be measured by way of some fixed norm, while misclassification means lying on the wrong side of the hyperplane, or rather in the wrong halfspace. In this paper we study the problem of determining such an optimal halfspace. In dimension d, we prove that there always exists an optimal separating halfspace passing through d affinely independent data points. This directly shows that the problem is polynomially solvable in fixed dimension by an algorithm of O(n). If a different norm or gauge is used for each dataset in order to measure distances to the hyperplane, or if all distances are measured by a fixed (asymmetric) gauge, then one can still show that there always exists an optimal separating halfspace passing through d− 1 affinely independent data points. The one-dimensional problem is extremely easy to solve: it suffices to find a balancing separating point, i.e. yielding an equal number (or weight) of misclassifieds for each dataset. It also follows that in any dimension any optimal separating halfspace always balances the misclassified points, where the balancing criterion now takes the shape of the used gauges into account.
منابع مشابه
Optimal Expected-Distance Separating Halfspace
One recently proposed criterion to separate two datasets in discriminant analysis, is to use a hyperplane which minimises the sum of distances to it from all the misclassified data points. Here all distances are supposed to be measured by way of some fixed norm, while misclassification means lying on the wrong side of the hyperplane, or rather in the wrong halfspace. In this paper we study the ...
متن کاملUnified Distance Formulas for Halfspace Fog
In many real-time rendering applications, it is necessary to model a fog volume that is bounded by a single plane but is otherwise infinite in extent. This paper presents unified formulas that provide the correct distance traveled through a fog halfspace for all possible camera and surface point locations. Such formulas effectively remove the need to code for multiple cases separately, thereby ...
متن کاملAn optimization problem on the sphere
We prove existence and uniqueness of the minimizer for the average geodesic distance to the points of a geodesically convex set on the sphere. This implies a corresponding existence and uniqueness result for an optimal algorithm for halfspace learning, when data and target functions are drawn from the uniform distribution.
متن کاملOptimal halfspace range reporting in three dimensions
We give the first optimal solution to a standard problem in computational geometry: three-dimensional halfspace range reporting. We show that n points in 3-d can be stored in a linear-space data structure so that all k points inside a query halfspace can be reported in O(log n + k) time. The data structure can be built in O(n logn) expected time. The previous methods with optimal query time req...
متن کاملApproximate range searching in higher dimension
Applying standard dimensionality reduction techniques, we show how to perform approximate range searching in higher dimension while avoiding the curse of dimensionality. Given points in a unit ball in , an approximate halfspace range query counts (or reports) the points in a query halfspace; the qualifier “approximate” indicates that points within distance of the boundary of the halfspace might...
متن کامل